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Abstract 

We study the relations between stellar mass, star formation history, size and internal 
structure for a complete sample of 122,808 galaxies drawn from the Sloan Digital Sky 
Survey. We show that low-redshift galaxies divide into two distinct families at a stellar 
mass of 3 x 1O 1O M . Lower mass galaxies have young stellar populations, low surface 
mass densities, and the low concentrations typical of disks. Their star formation histo- 
ries are more strongly correlated with surface mass density than with stellar mass. A 
significant fraction of the lowest mass galaxies in our sample have experienced recent 
starbursts. At given stellar mass, the sizes of low mass galaxies are log-normally dis- 
tributed with dispersion cr(lni?5o) ~ 0.5, in excellent agreement with the idea that they 
form with little angular momentum loss through cooling and condensation in a gravi- 
tationally dominant dark matter halo. Their median stellar surface mass density scales 
with stellar mass as oc Mj? , suggesting that the stellar mass of a disk galaxy is pro- 
portional to the three halves power of its halo mass. All this suggests that the efficiency 
of the conversion of baryons into stars in low mass galaxies increases in proportion to 
halo mass, perhaps as a result of supernova feedback processes. At stellar masses above 
3 x 1O 1O M0, there is a rapidly increasing fraction of galaxies with old stellar populations, 
high surface mass densities and the high concentrations typical of bulges. In this regime, 
the size distribution remains log-normal, but its dispersion decreases rapidly with in- 
creasing mass and the median stellar mass surface density is approximately constant. 
This suggests that the star formation efficiency decreases in the highest mass halos, and 
that little star formation occurs in massive galaxies after they have assembled. 



Keywords: galaxies: for mat ion, evolution; galaxies: stellar content 



1 Introduction 



Perhaps the most remarkable aspect of galaxies is their regularity - the fact that they can be 
classified into well-defined sequences. Hubble (1926) introduced the first scheme to categorize 
galaxies according to morphological type, and its basic concepts are still in use today. In 
its simplest form, three basic galaxy types are recognized: ellipticals, spirals and irregulars. 
These can be arranged in a linear sequence along which many properties vary coherently. The 
most striking trends are the variations in morphology (featureless ellipsoid to spiral disk to 
irregular appearance) and in colour and star formation activity. These trends were so striking 
that they formed the basis of Hubble's original scheme. Elliptical galaxies have red colours, 
little gas and no ongoing star formation; irregulars have blue colours, large gas fractions and 
are actively forming stars. An up-to-date review of star formation along the Hubble sequence 
is given in Kennicutt (1998). 

It should be remembered that Hubble introduced his galaxy classification scheme at a 
time when distance estimates were available for only a handful of galaxies. Hubble could not 
study the properties of galaxies as a function of parameters that measure their absolute scale: 
sizes, absolute magnitudes and masses. Today it is known that the distribution of Hubble 
types depends strongly on galaxy luminosity. The typical elliptical galaxy is a factor ~ 10 
more luminous in the B-band than the typical irregular galaxy (Roberts & Haynes 1994). As 
a result, the faint end of the galaxy luminosity function is dominated by late-type galaxies 
with strong emission lines, the bright end by early-type galaxies with little star-formation 
(e.g. Loveday et al 1992; Marzke et al 1994; Lin et al 1996; Zucca et al 1997; Marzke et al 
1998; Christlein 2000). These trends in star formation activity become even more pronounced 
when studied in the near-infrared. This has led to suggestions that the ratio of present to 
past-averaged star formation rate may depend primarily on stellar mass and only secondarily 
on morphological type (Boselli et al 2001; Balogh et al 2001). 

There have been many studies of stellar populations and star formation histories that focus 
on galaxies of a particular Hubble type. The star formation rates in most irregular and dwarf 
galaxies in the Local Group do not appear to have varied smoothly with time, but instead 
have undergone large fluctuations or "bursts" (see Grebel 2000 for a recent review). There is 
evidence that the recent star formation histories of spirals depend on the surface brightness of 
their disks. Low surface brightness galaxies appear to have bluer colours and higher gas mass 
fractions than high surface brightness galaxies (e.g. de Blok, van der Hulst & Bothun 1995; 
de Blok, McGaugh & van der Hulst 1996; Bell & de Jong 2000). The situation for ellipticals is 
currently unclear. Although there is considerable evidence that the absorption line strengths 
of early-type galaxies vary systematically with their velocity dispersions, there is much debate 
as to whether this reflects trends in stellar age, in metallicity or in the relative abundance of 
different heavy elements (e.g. Trager et al 1998; Kuntschner et al 2001). 

The Sloan Digital Sky Survey will obtain u, g, r, i, and z-band photometry, spectra and 
redshifts for at least 700,000 galaxies down to a limiting r-band magnitude of 17.77. This is 
the most ambitious survey of the local galaxy population ever undertaken and motivates a 
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critical re-evaluation of trends in the star formation histories of galaxies as a function of mass, 
size, internal structure and environment. 

We have recently developed a new method to constrain the star formation history, and to 
estimate the dust attenuation and stellar mass of galaxies (Kauffmann et al 2002; hereafter 
Paper I). It is based on two stellar absorption line indices, the 4000 A break strength and 
the Balmer absorption line index H^. These indices allow us to constrain the mean stellar 
age of a galaxy and the fraction of its stellar mass formed in bursts over the past few Gyr. 
A comparison with broad band magnitudes then yields estimates of dust attenuation and of 
stellar mass. We have generated a large library of Monte Carlo realizations of different star 
formation histories, including starbursts of varying strength. We have used this library to 
generate likelihood distributions for parameters such as burst mass fraction, dust attenuation 
strength, stellar mass and stellar mass-to-light ratio for every galaxy in a complete sample of 
122,808 objects drawn from the Sloan Digital Sky Survey. 

Paper I introduced our method and used it to measure how the total stellar mass of the 
Universe is distributed over galaxies as a function of their stellar mass, size, concentration, 
colour and surface mass density. In this paper, we focus on the mass dependence of the star 
formation histories, sizes and structural parameters of galaxies. We study to what extent the 
recent star formation histories of galaxies of different stellar mass can be characterized as con- 
tinuous or as burst-dominated. We also show how the distributions of structural parameters 
(size, surface mass density, concentration index) vary with stellar mass, and we demonstrate 
that variations in star formation history and in structural parameters are tightly coupled. 
Finally, we interpret our findings in the context of current theories of galaxy formation. 

2 Review of the Observed and Derived Parameters of 
Galaxies 

The sample of galaxies analyzed in this paper is drawn from the Sloan Digital Sky Survey 
(York et al 2000; Stoughton et al 2002). We have included all galaxies with spectroscopic 
observations in the Data Release One (DR1) and with Petrosian r band magnitudes in the 
range 14.5 < r* < 17.77 after correction for foreground galactic extinction. Details about 
the spectroscopic target selection may be found in Strauss et al (2002). These data represent 
roughly 20% of the projected survey total and include a total of 122,808 galaxies. 

The reader is referred to Paper I for more information about the sample and a detailed 
description of the methods used to derive parameters such as stellar mass and burst mass 
fraction. We also discuss the uncertainties in our methods in Paper I. Here we provide a brief 
summary of the quantities that are studied in this paper: 

1. Absolute magnitudes: All magnitudes quoted in this paper are Petrosian magni- 
tudes. In order to convert from apparent to absolute magnitude, we assume a Friedman- 
Robertson- Walker cosmology with f2 = 0.3, A = 0.7 and H = 70 km s _1 Mpc -1 . We 
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have calculated the K-corrections K(z) for each galaxy using the routines in kcorrect 
vl_ll (Blanton et al. 2002). In order to minimize the errors in this procedure, we 
K-correct the magnitudes of all galaxies in our sample to z = 0.1 More details about 
the SDSS photometric system may be found in Fukugita et al (1996) and Smith et al 
(2002). Information relating to the SDSS camera and photometric monitoring system 
can be found in Gunn et al (1998) and Hogg et al (2001). A technical paper describing 
the astrometry in the survey has recently been published by Pier et al (2002). 

2. Stellar spectral indices: We have adopted the narrow definition of the 4000 A break 
introduced by Balogh et al (1999), which we denote D„(4000). We also use the Balmer 
absorption index H5a as defined by Worthey & Ottaviani (1997) using a central band- 
pass bracketed by two continuum bandpasses. The evolution of the two indices have 
been calibrated using a new population synthesis code that incorporates high resolution 
stellar libraries (Bruzual & Chariot 2002). Our measurements of H5a are corrected for 
contamination due to nebular emission. As discussed in Paper I, the D n (4000) index is 
an excellent age indicator for young (< 1 Gyr, D n (4000)< 1.5) stellar populations, but 
for older stellar populations, the index depends quite strongly on metallicity. Strong 
Balmer absorption occurs in galaxies that experienced a burst of star formation 0.1-1 
Gyr ago. Analyzed together, the two indices are a powerful probe of the recent star for- 
mation history of a galaxy. Because each is defined over a narrow wavelength interval, 
they are not sensitive to dust attenuation effects. 

3. Stellar masses: The stellar masses are derived assuming a universal initial mass func- 
tion (IMF) in the parametrisation of Kroupa (2001). The typical 95% confidence range 
in our estimated stellar masses is ± 40 %. 

4. Burst mass fractions: We define the parameter -F 6ursi as the fraction of the total 
stellar mass of a galaxy that formed in "burst" mode over the past 2 Gyr. In Paper I 
we showed that galaxies with continuous star formation histories occupy a very narrow 
locus in the H<5a/D„(4000) plane. Galaxies that experienced recent bursts have B.5a 
values that are displaced from this locus. Because the typical observational error on the 
H5a index is large (1.4 A ) in our sample, the median value of the likelihood distribution 
of F burst is sensitive to the mix of star formation histories in our model library, which 
functions as a Bayesian prior for our analysis. As a result, we focus below on galaxies 
for which our analysis implies F burst > with high confidence (97.5% for our standard 
prior). The tests described in Paper I showed the resulting sample to be insensitive to 
the actual prior adopted. 

5. Galaxy sizes: We study the distribution functions of R50(z), the radius enclosing 50% 
of the Petrosian z-band luminosity of a galaxy. We note that the sizes output by the 
current SDSS photometric pipeline have not been corrected for seeing effects (Stoughton 
et al 2001). 
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6. Concentration indices: In order to be consistent with previous work (Blanton et al 
2001; Strateva et al 2001; Shimasaku et al 2001) we define the concentration index C 
as the ratio R90/R50, where R90 and R50 are the radii enclosing 90% and 50% of the 
Petrosian r-band luminosity of the galaxy. It has been shown by Shimasaku et al (2001) 
and Strateva et al (2001) that for bright galaxies, there is a good correspondence between 
concentration parameter and 'by-eye' classification into Hubble type, with C ~ 2.6 
marking the boundary between early and late type galaxies. 

7. Surface brightnesses and surface mass densities: Following Blanton et al (2001), 
we define the half-light surface brightness /x50 to be the average r-band surface brightness 
within the r-band half-light radius R50 in magnitudes per square arcsecond. The main 
spectroscopic survey is complete down to a limiting half-light surface brightness /x50 = 
24.5 mag arcsec -2 . We define the surface mass density //* as 0.5M*/[7r.R50 2 (,2:)], where 
R50(z) is the Petrosian half-light radius in the z-band (we choose a radius defined in 
the z-band rather than r-band, as this provides a better approximation to the radius 
enclosing half the total stellar mass). 

In this paper, we will often use bivariate density distributions <f>(x, y) in the parameters x 
and y, defined such that <f>(x, y)dx dy is the number of galaxies per unit volume with x in the 
interval (x, x + dx) and y in the interval (y,y + dy). Such bivariate distributions are calculated 
by weighting each galaxy in the sample by 1/V max , where V max is the volume corresponding 
to the total redshift range over which the galaxy would pass our sample selection criteria. 
Because our sample is extremely large, errors due to large scale structure are expected to be 
small. In many cases we will be interested in trends in the distribution of the parameter y 
as a function of the parameter x (for example in the distribution of concentration or size as 
a function of stellar mass). In such cases, we make the trends more visible by plotting the 
conditional distribution of y given x, i.e. <j)(x,y)/ J <p(x,y')dy' . 

3 The Observed Correlations 

3.1 The Dependence of Star Formation History on Stellar Mass 

In Fig. 1, we present the conditional density distributions of our two stellar age indicators 
D n (4000) and as a function of stellar mass and as a function of (/-band absolute magnitude. 
The grey-scale indicates the fraction of galaxies in a given logarithmic mass (or magnitude) bin 
that fall into each age-indicator bin. The contours are separated by factors of 2 in population 
density. 

It is clear that D n (4000) and depend strongly on both stellar mass and absolute 
magnitude. The main effect of transforming from luminosity to mass is to produce a more 
regular variation of the indices. Fig. 1 shows that low mass galaxies are 'young'- they 
typically have low values of D n (4000) and high values of H^. At a mass of M, ~ 3 x 10 10 
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M , a transition towards older stellar populations begins to take effect. Almost all of the 
most massive galaxies in our sample have high values of D„(4000) and low values of H5a- 

Fig. 2 shows "slices" through the D n (4000)-M* distribution. Recall that the measurement 
error on the D n (4000) index is small, typically around 0.04, or a few percent of the total range 
of values spanned by galaxies in our sample. Again, one sees a striking trend towards older 
stellar populations for galaxies with larger stellar masses. Both in Fig. 1 and in these plots 
one gets the impression of two separate and relatively well-defined populations. The relative 
weight of the 'old' population increases strongly at stellar masses above ~ 1O 1O M . 

The bivariate density distribution of H5a and D n (4000) is shown in Fig. 3 for galaxies in 8 
ranges of stellar mass spanning four orders of magnitude from = 1O 8 M to M* = 1O 12 M . 
In these plots, the grey scale indicates the fraction of galaxies that fall into each bin of HSa 
and D n (4000). It is striking how galaxies move diagonally across the H5 J 4/D n (4000) plane as 
their masses increase. 

From Fig. 3, one can see clearly that low mass galaxies with D n (4000) < 1.5 (characteristic 
of stellar populations with mean ages of less than a few Gyr) have a larger scatter in R5a 
equivalent width than high mass galaxies with similar break strengths. This means that the 
fraction of low mass galaxies that have experienced recent bursts is higher than that for high 
mass galaxies, even when the two populations are compared at similar mean stellar age. We 
illustrate this in detail in Fig.4, where we plot the median value of RSa as a function of 
D n (4000) for low mass and for high mass galaxies. The solid and dotted errorbars indicate 
the 25th to 75th and 5th to 95th percentile ranges of the distribution of H5a values. For 
comparison, we have overplotted 200 different model galaxies with continuous star formation 
histories. The models span a wide range in exponential timescale (r = 0.1 to 10 Gyr) and 
formation redshift (zf orm = 20 to 0.2). The reader is referred to Paper I for more details 
about the models. Solar metallicity models are plotted in the right hand panel. In the left 
hand panel, we have plotted 20% solar models, which are more appropriate for galaxies with 
masses between 10 8 and 10 9 M (Tremonti et al 2003) For massive galaxies , the data agree 
extremely well with the model predictions. The scatter in B.5a at given D ra (4000) is consistent 
with that expected from observational errors. Note that although we plot the distribution of 
H5a down to very low D n (4000) values, Fig. 2 shows that less than 5% of galaxies with stellar 
masses in the range 10 10 — 1O 11 M have values D n (4000) too low to be consistent with any of 
our continuous models. For low mass galaxies, this is no longer true. Fig. 2 shows that ~ 30% 
of all galaxies have D Tt (4000)< 1.2. As discussed in Paper I, many of these galaxies are likely 
to be experiencing a starburst at the present day. Fig. 4 shows that for low mass galaxies, 
the data are also displaced upwards relative to the predictions of the continuous models and 
the scatter is too large to be consistent with observational errors. As discussed in Paper I, 
galaxies with moderate values of D n (4000) and very high values of US a are likely to be in a 
post-starburst phase. 

In Paper I, we introduced models in which galaxies formed stars in two different modes: 
i) a "continuous" mode , which we parametrized by an exponential law with timescale r and 
starting time tf orm , and ii) a superposed "burst" mode. A burst was defined to be an episode 
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of star formation lasting between 3 x 10 7 and 3 x 10 8 years in which a fraction F burst of the 
total stellar mass of the galaxy was formed. In Paper I, we also introduced a method that 
allowed us to calculate the a posteriori likelihood distribution of Fb urst for each galaxy in 
our sample, given its observed absorption line indices and the measurement errors on these 
indices. In Fig. 5, we plot the distribution of galaxies as a function of Fburst for 8 different 
mass ranges. The distribution of the median value of Fburst (which we denote Fb urst (50%)) for 
all galaxies in the given mass range is plotted as a solid histogram. We define a subsample 
of high-confidence bursty galaxies as those objects with Ft, urst (2.5%) > 0, where Ff, urst (2.5%) 
is the lower 2.5 percentile point of the likelihood distribution of Fb urst . The distribution of 
Fburst(50%) for this subsample is plotted as a dotted histogram in Fig. 5. 

In our lowest mass range (10 8 — 3 x 1O 8 M ), we find that half of all galaxies have 
-Pfeurst(50%) > 0. However, it is only for 10% of these objects that we can state with high 
(>97.5%) confidence that a burst did occur in the recent past. The fraction of galaxies that 
have experienced recent bursts decreases very strongly with increasing stellar mass. This 
trend is apparent both in the fraction of galaxies with Ff, ursi (50%) > and in the fraction 
with Fb ur ,;t (2. 5%) > 0. There are very few high confidence bursty galaxies in our high mass 
bins. Out of our entire sample of 120,000 galaxies, we only pick out around 500 galaxies 
with masses comparable to that of the Milky Way and with Ff, urst (2.5%) > 0. For our full 
sample, the median value of the burst mass fraction correlates with stellar mass. The same 
is not true for the high confidence bursty galaxies - these systems have burst mass fractions 
that are independent of stellar mass. Note that our requirement that a burst be detected 
at high confidence will, by definition, bias this subsample towards high burst fractions. The 
trend in burst mass fraction for our full sample is in some sense expected, because it is well 
known that more massive galaxies have lower gas mass fractions and thus contain less fuel 
for star formation (e.g. Roberts & Haynes 1994; Boselli et al 2001; Bell & de Jong 2001). 
What about the influence of our adopted prior? The analysis in Paper I showed that decreas- 
ing the number of bursty galaxies in the model library resulted in lower estimated values of 
Fburst{50%) . Our standard prior assumes that the fraction of bursty galaxies is constant, in 
apparent contradiction with the results shown in Fig. 5. Our choice of prior would there- 
fore tend to weaken, rather than strengthen, any true decrease in F burs t towards high masses. 
Paper I also demonstrated that the definition and inferred properties of our high confidence 
subset of bursty galaxies in insensitive to the adopted prior. 

In Figs. 6 and 7 we compare the distributions of Ha emission line equivalent widths for 
galaxies with i^ urst (2.5%) > and for galaxies with Fb urs t{50%) = 0. In Fig. 6, we have 
subdivided the two samples into different bins in D„(4000) in order to take into account any 
systematic mean age difference between galaxies with and without recent bursts. For galaxies 
with low values of D n (4000), there is a tendency for bursty galaxies to have stronger Ha 
emission than non-bursty galaxies with similar break strengths. These are galaxies whose 
present-day star formation rates are enhanced over other systems of similar mean stellar age 
(recall that the Ha emission is coming from stars less than 10 7 years in age). For galaxies with 
high values of D„(4000), the trend is reversed. These high D„(4000) galaxies are presumably 
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"post-starburst" systems that experienced a strong burst some time in the past, but have 
subsequently stopped forming any stars. In Fig. 7, we compare galaxies with F burst {2.5%) > 
and galaxies with Fb urst (50%) = in fixed bins of stellar mass. Low mass bursty galaxies are 
skewed to higher Ha equivalent widths, while high mass bursty galaxies exhibit a wider range 
in equivalent widths than their counterparts with Fb urst (50%) = 0. Because information about 
Ha was not used when selecting our sample of bursty galaxies, we regard these differences in 
emission line properties as evidence that our method does pick out a sample of galaxies with 
very strong variations in recent star formation history. 



3.2 The Dependence of Structural Parameters on Stellar Mass 

In Fig. 8, we present conditional density distributions for concentration index C, r-band 
half-light surface brightness /x50, and surface mass density /x* as functions of stellar mass and 
of r-band absolute magnitude. Fig. 8 demonstrates that the structural parameters of galaxies 
correlate both with absolute magnitude and with stellar mass, but once again we find that 
the trends are smoother when plotted as a function of stellar mass. 

The surface mass density exhibits a strikingly tight correlation with stellar mass. 
increases by nearly two orders of magnitude from ~ 1O 7 M kpc -2 for galaxies with M* ~ 
10 8 M Q to ~ 1O 9 M kpc~ 2 for the most massive galaxies with M* = 1O 12 M . In contrast, the 
r-band surface brightness only increases by a factor of 4 as M(r) increases by 8 magnitudes. 
There is a sharp change in slope in the /i* — M* relation at a stellar mass of ~ 3 x 10 10 M Q . A 
transition at the same stellar mass is also seen in the C — M* relation. Interestingly, this is also 
the stellar mass at which galaxies switch from low D n (4000) to high D n (4000) in Fig. 1. It is 
important to note that virtually all galaxies in our sample with M* > 1O 8 M have /ii/2 < 23.5 
mag arcsec -2 . The main spectroscopic survey is complete to surface brightnesses a magnitude 
fainter than this, so our results should not be biased by surface brightness selection effects. 

Fig. 9 presents the size distributions for galaxies in eight disjoint ranges of stellar mass. 
We plot the fraction of galaxies as a function of the natural logarithm of the half-light radius 
in the z-band. These size distributions are extremely well described by a log-normal function 
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where R me d is the median value of the size distribution and a is the dispersion in \nR. We 
have indicated the value of a and R me d for the lognormal fit in each panel in Fig. 8. The shape 
of the size distribution is largely independent of mass for galaxies with M* < 3 x 1O 1O M and 
is well fit by a ~ 0.4 — 0.5. The median size R me d scales with stellar mass as R me d oc M°- 18 . 
For stellar masses larger than 3 x 1O 1O M , the shape of the size distribution is still lognormal, 
but the dispersion decreases. R me d also increases more rapidly with mass (oc M° 33 ) in this 
regime. 

Figs. 10 and 11 show the distributions of concentration index C and surface mass density 
yU* for these same eight stellar mass ranges. The distribution of C is nearly independent of 
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stellar mass for galaxies with M» < 3 x 1O 1O M . The median value of C for these systems 
is around 2.3. Only about 10% of low mass galaxies have C > 2.6, the value that marks 
the transition from late-type to early-type morphologies (Strateva et al 2001). At larger 
masses, the distribution shifts to progressively higher concentrations. In our highest mass bin 
90% of galaxies have C index values larger than 2.6. Fig. 11 shows that for galaxies with 
M, < 3 x 1O 1O M , the median surface mass density scales with stellar mass as //* oc Mf 63 . 
The //* distribution is fairly broad and its shape does not appear to depend very strongly on 
stellar mass. At values of M* larger than 3 x 1O 1O M , the scaling of the median value of /j,* 
with stellar mass becomes much weaker. The shape of the distribution function skews and is 
eventually strongly peaked at values of /i* around 1O 9 M kpc~ 2 . 

We note that all structural parameters discussed in this section have been calculated 
within circular apertures. We have estimated the additional scatter in size resulting from 
variations in axis-ratio and find that this has very small (< 10%) effect on the values of the 
dispersion a that we estimate. Because of seeing effects, R50(z) may be overestimated for 
galaxies with small angular sizes. We have split the galaxies in each stellar mass range into 
two equal samples according to redshift, and we have compared the size distribution of the 
nearby sample with that of the full sample. This is illustrated in two of the panels of Fig. 9. 
As can be seen, the effects appear to be quite small. In addition, we have checked the scalings 
using R90(z) and find that our conclusions remain unchanged. 

3.3 The Connection Between Star Formation History and Struc- 
tural Parameters 

In the previous two subsections we showed that both the star formation histories and the 
structural parameters of galaxies depend strongly on stellar mass. It is therefore not surprising 
that the star formation histories of galaxies correlate with /i* and C. This is illustrated in 
Fig. 12 where we present the conditional density distributions of the two stellar absorption 
indices D n (4000) and H5a as functions of surface mass density and of concentration. Galaxies 
with low surface density have low values of D„(4000) and galaxies with high surface density 
have high values of D n (4000), with a strong transition at /i* ~ 3 x 1O 8 M kpc~ 2 . There is 
also a striking transition in the values of D„(4000) and R5 A at C values around 2.6. We note 
that this corresponds very well to the value of C recommended by Strateva et al (2001) for 
optimum separation between early and late Hubble types, based on an analysis of a small 
sample of galaxies classified by eye. 

We now ask whether it is possible to pinpoint the primary cause of variations in the star 
formation history of galaxies - do galaxies have low values of D n (4000) and high values of RSa 
because their masses are small or because their surface mass densities are low? We attempt to 
address this question in Fig. 13 where we plot the fractions of galaxies with D„(4000)> 1.55 
and with H5a < 2.0 as a function of surface mass density in narrow ranges of stellar mass 
(left panels), and as a function of stellar mass in narrow ranges of surface mass density (right 
panels). We only plot these fractions for values of M* and ^ where there are more than 100 
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galaxies per bin. D n (4000) = 1.55 and H5 j4 =2 are chosen as natural places to divide the sample, 
because this is where clear transitions occur in the D n (4000)/C and H8a/C plots in Fig. 12. 
(Note that the C index is determined directly from the photometric data, independently of 
both stellar mass and surface mass density). 

Fig. 13 shows that the two indices are more fundamentally related to //* than to M*. The 
fraction of galaxies with D„(4000)> 1.55 or with < 2 is largely independent of stellar 
mass at a given value of //*, although the most massive galaxies with M* > lO n M clearly 
deviate in that they have larger values of D n (4000) and smaller values of H5a at given At 
given stellar mass, however, the fraction of galaxies with D n (4000) > 1.55 or with H5a < 2 is a 
strongly increasing function of //*. These results are in qualitative agreement with those of Bell 
& de Jong (2000), who studied trends in the optical and infrared colours of a sample of nearby 
spiral galaxies and found that the colours most sensitive to star formation history correlate 
best with surface density. It also fits in well with numerous studies that have demonstrated 
a clear correlation between star formation rate and gas surface density for samples of spirals 
(e.g. Kennicutt 1983; Wong & Blitz 2002). 

3.4 Summary 

In the previous subsections, we demonstrated that there are strong correlations between the 
star formation histories, stellar masses and structural parameters of galaxies. Here we provide 
a summary of our results: 

1. Low mass galaxies have low values of D„(4000) and high values of H5a, indicative of 
young stellar populations . High mass galaxies have high values of D n (4000) and low 
values of H5a, indicative of old stellar populations. A sharp transition from young to 
old occurs at a stellar mass of around 3 x 1O 1O M . 

2. Low mass galaxies have low surface mass densities and low concentrations. High mass 
galaxies have high surface mass densities and high concentrations. An abrupt change in 
the slopes of the //* — M* and C — M* relations occurs at a stellar mass of 3 x 1O 1O M . 

3. The size distribution function of galaxies less massive than 3 x 1O 1O M is well character- 
ized by a lognormal distribution with dispersion a = 0.4 — 0.5. Galaxies more massive 
than 3 x 1O 1O M also have a lognormal size distribution, but the dispersion is smaller. 

4. Below the transition mass the median size of galaxies increases as M°- 18 . Above the 
transition mass the increase is more rapid, R oc M°- 3 . 

5. Galaxies with low surface mass densities and low concentrations have D n (4000) and 
H5a values indicative of young stellar populations. Galaxies with high surface mass 
densities and high concentrations have D n (4000) and H5a values indicative of old stellar 
populations. The transition from young to old stellar populations occurs at //* ~ 3 x 
1O 8 M kpc" 2 and C ~ 2.6. 
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6. The star formation histories of low mass galaxies, as traced by D n (4000) and , 
appear to be more fundamentally related to surface mass density than to stellar mass. 

7. The fraction of galaxies that have experienced recent starbursts decreases strongly with 
increasing stellar mass (and increasing surface mass density). 

4 Interpretation and Discussion 

Our analysis has demonstrated that there is a sharp transition in the physical properties of 
galaxies at a stellar mass of ~ 3 x 10 10 M Q . Galaxies less massive than this have low surface 
mass densities, low concentration indices typical of disks and young stellar populations. More 
massive galaxies galaxies have high surface mass densities, high concentration indices typical 
of bulges, and predominantly old stellar populations. 

The size distribution of disk galaxies is of particular interest, because it can be compared 
directly to simple theoretical models. In a classic paper in 1980, Fall & Efstathiou considered 
the formation of a disk by the condensation of gas in a gravitationally dominant dark matter 
halo. If the gas initially has the same specific angular momentum as the dark matter and 
conserves its angular momentum during its contraction, then the characteristic radius of the 
disk will scale as 

R d OC XRhalo, (2) 

where Rhaio is the virial radius of the halo. The spin parameter A is defined as 

A = J\E\^G-'M~X\ (3) 

where E is the total energy of the halo, J is its total angular momentum and M is its total 
mass (see Mo, Mao & White (1998) for a detailed analytic model). 

N-body simulations show that in hierarchical clustering cosmologies, the distribution of 
A for dark matter halos has lognormal form with parameters which depend very weakly on 
cosmology or on halo mass (Barnes & Efstathiou 1987; Warren et al 1992; Cole & Lacey 1996; 
Lemson & Kauffmann 1999) with typical values X me d — 0.04 and <j\ ~ 0.5. The fact that for 
low mass galaxies the shape and width of the size distributions in Fig. 9 agree so well with 
this simple theory is truly remarkable. We note that our results are in qualitative agreement 
with those of Syer, Mao & Mo (1999), who find that the distribution of spin parameter A 
inferred from a sample of nearby disk galaxies is in excellent agreement with the predictions 
of cosmological simulations. On the other hand, our results appear to disagree with those of 
de Jong & Lacey (2000), who find that the distribution of disk sizes is narrower than predicted 
by simple models. We note that neither of these two studies addressed possible variations in 
the size distributions of disk galaxies as a function of luminosity or of mass. 

It is not obvious how the stellar mass of a galaxy should scale with the mass of its dark 
matter halo. For simplicity, one might assume that a fixed fraction of the baryonic mass ends 
up in disk stars, in which case M* oc R 3 halo and R d oc M* 1/3 (this assumes that baryons follow 
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the dark matter, at least initially). From Fig. 9, we find that median size increases more 
slowly than predicted. This means that the stellar mass fraction must increase with halo 
mass. 

We now derive the scaling between the stellar mass of a disk galaxy and the mass of its 
dark matter halo from the slope of the //* — M* relation. If we adopt the assumptions of the 
simple model, we can write 

H* oc M*IR\ oc eM halo /R 2 halo oc eM^ 3 D oc e 2 ' z M l J\ (4) 

where e oc M*/Mh a io is the efficiency with which the disk has converted the available gas into 
stars. The Fall & Efstathiou arguments are only directly relevant to the formation of disk 
galaxies, while the conditional density distributions shown in Fig. 9 are derived using all 
galaxies in the sample. At stellar masses greater than 1O 1O M , a large fraction of the galaxies 
are bulge-dominated systems. In order to clarify the scaling properties of our two principal 
classes of galaxy, we define two subsamples which eliminate most of the mixed systems: i) a 
sample of late-type galaxies with C < 2.6 and -D n (4000) < 1.55 (41312 galaxies); and ii) a 
sample of early-type galaxies with C > 2.6 and _D n (4000) > 1.55 (53119 galaxies). Fig. 12 
demonstrates that these cuts divide young, disk-dominated systems from old, bulge-dominated 
systems surprisingly cleanly. 

The /x* — M* relation of our late-type subsample is shown in the left panel of Fig. 14, while 
the right panel shows the corresponding relation for early-types. It is striking that the //* — M* 
relation for the late-type subsample can be described quite well by a single power-law. The 
//* — M* relation for early-type galaxies still appears to undergo a transition in slope. (Note 
that there are few late- type galaxies with stellar masses above 2 x lO n M and few early-types 
with M* below 1O 1O M .) Fig. 14 also shows that early-type galaxies of given stellar mass have 
higher stellar surface densities than late-type galaxies of the same mass. This difference is 
largest for the lower mass objects and disappears for the most massive systems in our samples. 

Fitting to the dominant part of the population of late-type galaxies (i.e. over the stellar 
mass range 10 8 - 10 n M Q ) , we find /j* oc Mf 535±a03 , so that e oc M°- 30±ao5 oc M°f o ±0m 
and Rd oc M°' 23±0 02 . The star formation efficiency for disk galaxies apparently increases with 
halo mass. Such an increase is expected in a model where supernova feedback controls the 
dynamics of the interstellar medium, (e.g. McKee & Ostriker 1977; Efstathiou 2000). In 
most feedback models, supernovae inhibit the formation of stars in low mass halos, leading to 
small values of M*/M ha i (e.g. White & Frenk 1991; Kauffmann, White & Guiderdoni 1993; 
Cole et al 1994). However, because we do not have any information about the gas fractions 
of the galaxies in our sample, we cannot say whether the suppression of star formation occurs 
because gas is expelled from the galaxy and from its halo, as is often assumed in these models, 
or because the conversion of gas into stars is slowed in low mass systems. It is interesting 
that Tremonti et al (2003) find the gas phase metallicity of low mass galaxies to increase with 
stellar mass as Z oc M°- 4 " 05 (depending on which metallicity indicator they use), suggesting 
a similar dependence of overall star formation efficiency on stellar mass. 

For early-type galaxies, we fit over the range 3 x 10 10 — 3 x 1O 12 M and obtain /i* oc 
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M°' 0±0 - 02 . If we assume that Rd oc Rhaio for these galaxies, this would imply that e oc MT , 
i.e. the the star formation efficiency for ellipticals decreases with halo mass. Clues as to 
why this is the case may come from recent Chandra and XMM observations, which show that 
cooling of gas to temperatures below 1 keV is suppressed at the centres of rich clusters, where 
the most massive ellipticals are located (e.g. Fabian et al 2001; Bohringer et al 2001). It seems 
very likely that the assembly of bulge-dominated galaxies involved chaotic processes such as 
mergers or inhomogenous, rapidly star-forming collapse, which, for most systems, must have 
completed well before the present day. Under these circumstances, it is not clear that our 
assumed scaling between galaxy radius and dark matter halo radius is valid. Even if the 
simple scaling laws were to apply, the halo sizes and surface densities relevant for early-type 
galaxies are likely to be defined at higher effective redshift than those relevant for late-type 
galaxies. This may explain why early-type galaxies in Figure 14 have higher surface densities 
than late-type galaxies of similar mass. The scatter in surface density (and size) is also smaller 
for early-type than for late-type galaxies, as is very evident in Figure 14 as well as in Figures 
8 and 9. 

We note that the Lyman Break Galaxies (LBGs) at high redshift can plausibly be identified 
with the progenitors of early-type galaxies. Pettini et al.(2001) have derived a mean effective 
dynamical mass of 1.3 xlO lo M within a mean half-light radius of 2.3 kpc for a sample 
of LBGs at z ~ 3. Shapley et al.(2001) and Papovich et al. (2001) have estimated the 
stellar massses of LBGs using their observed rest-frame ultraviolet and optical spectral energy 
distributions. These agree well with the dynamical mass estimates. Lyman Break Galaxies 
thus have characteristic surface mass densities of ~ 1O 9 M kpc -2 , quite similar to those of 
present-day early-type galaxies. Moreover, if the star-formation associated with the Lyman 
Break Galaxies is integrated from from z ~ 2 to 6, one finds that they could have produced 
most of the stars in present-day early-type galaxies (Steidel et al. 1999; Thompson et al 2001). 

In the framework of hierarchical models of galaxy formation, the shape of the size dis- 
tributions of low mass galaxies is easy to understand, but the trend of younger mean stellar 
ages with decreasing stellar mass is puzzling, since lower mass dark matter halos typically 
form at earlier epochs. If all the gas cooled off and formed stars when these halos were as- 
sembled, lower mass galaxies would have older stellar populations (e.g. Kauffmann, White 
& Guiderdoni 1993). We have seen, however, that non-gravitational processes such as super- 
nova feedback may play an important role in regulating the rate at which stars form in these 
systems. To explain the observations, these processes must slow the conversion of gas into 
stars so that low mass systems remain gas rich until the present day and have low overall 
efficiencies for turning gas into stars. We have seen that the star formation histories of low 
mass galaxies are correlated more strongly with surface density than with stellar mass. If 
feedback processes are able to prevent the gas in low suface density galaxies from dissipating 
to form dense molecular clouds, then their star formation timescales will be long. 

Conversely, it may appear puzzling that star formation has terminated in the majority of 
high mass galaxies, because the hierarchical model claims that these systems were assembled 
relatively late. In order to explain the observations, star formation must have been both 
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efficient and rapid in the progenitors of these systems, so that the final stages of their assembly 
involved rather little dissipation or star formation. In current models, galaxies that fall into 
groups and clusters are stripped of their reservoir of gas. As a result, their star formation 
rates decline and their colours redden (see for example Diaferio et al 2001). This is unlikely 
to explain why the majority of massive galaxies do not form stars at the present day, because 
many do not reside in rich groups and clusters. In such systems, gas must somehow be 
prevented from cooling and forming stars after the bulge has formed. Again, the suppressed 
cooling at the centres of observed galaxy clusters may offer a clue to the mechanism responsible 
for this. 

Finally, it will be important to understand the physical origin of the characteristic mass 
scale that is so strikingly imprinted on the galaxy population. The structural properties and 
formation paths of dark matter halos vary smoothly with mass. The abrupt transition in the 
stellar ages, star formation histories and structure of galaxies that we find at 3 x 1O 1O M can 
only be explained by star formation and/or feedback processes. We believe that this tension 
between simplified phenomenological models of galaxy evolution and the trends seen in the 
data will lead to new insight into the physical processes that regulate how galaxies form and 
evolve. We intend to address these issues in more detail in future work. 
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Figure 1: Conditional density distributions showing trends in the stellar age indicators D n (4000) 
and H5a as functions of the logarithm of stellar mass and of g-band absolute magnitude. Galaxies 
have been weighted by 1/V max and the bivariate distribution function has been normalized to a fixed 
number of galaxies in each bin of logM* or M(g). Here and in all subsequent contour plots, each 
contour represents a factor two change in density. 
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Figure 2: Histograms showing the fraction of galaxies as a function of D„(4000) in 8 different ranges 
of stellar mass. The numbers in the upper right corner of each panel list, from top to bottom, the 
5th, 25th, 50th, 75th and 95th percentiles of the distribution. The number in the lower right corner 
is the number of galaxies contributing to the histogram. 
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Figure 3: The bivariate distribution function of H5a and D n (4000) in 8 ranges of stellar mass. 
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Figure 4: Black solid symbols show the median value of as a function of D n (4000) in two 
different stellar mass ranges. The solid and dotted errorbars indicate the 25th to 75th and 5th to 
95th percentile ranges of the distribution of H5a- Coloured symbols indicate the locus occupied by 
galaxies with continuous star formation histories. In the left hand panel, we have plotted 25% solar 
models and in the right panel we plot solar metallicity models. 
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Figure 5: The solid histograms show the distributions of galaxies as a function of median value 
of the likelihood distribution of Fburst in 8 stellar mass ranges. The dotted histograms show the 
distributions of the median value of Fb urs t for the high-confidence bursty galaxies in each mass range. 
The numbers on each panel indicate the fraction of galaxies in each mass bin with Fburst 

(50%) > 

(upper) and Fft urst (2.5%) > (lower). 
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Figure 6: The distribution of Ha equivalent widths for galaxies with Fb urst (2.5%) > (dashed)) is 
compared to that for galaxies with -Ffe ur st(50%) = (solid) for four different ranges in the value of 
the D n (4000) index. The numbers listed in the panels indicate the number of objects in the bursty 
sample (top) and in the continuous sample (bottom). 
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Figure 7: The distribution of Ha equivalent widths for galaxies with Fb urst (2.5%) > (dashed)) is 
compared to that for galaxies with -Ffe urs t(50%) = (solid) for four different ranges in stellar mass. 
The numbers listed in the panels indicate the number of objects in the bursty sample (top) and in 
the continuous sample (bottom). 
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Figure 8: Conditional density distributions showing trends in the structural parameters (i*, H\i<i 
and C = i?90/i?50 as a function the logarithm of stellar mass and as a function of r-band absolute 
magnitude. Galaxies have been weighted by 1/V max and the bivariate distribution function has been 
normalized to a fixed number of galaxies in each bin of log M* and of r-band absolute magnitude. 
The line in the bottom left panel indicates the surface brightness completeness limit of the SDSS 
survey. 
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Figure 9: Solid histograms show the fraction of galaxies as a function of the natural logarithm of 
the half-light radius in the z-band (R50(z)) in 8 different ranges of stellar mass. The curves show 
lognormal fits to the distribution and the numbers in each panel are the best-fit values of a (top) and 
Rmed (bottom). The dotted histograms in the panels with 9 < logM* < 9.5 and 11 < logM* < 11.5 
are for the subsample of galaxies with V/V max < 0.5. 
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Figure 10: Histograms showing the distribution of galaxies as a function of C index for 8 different 
ranges of stellar mass. The numbers in each panel list, from top to bottom, the 5th, 25th, 50th, 75th 
and 95th percentiles of the distribution. 
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Figure 11: Histograms showing the distribution of galaxies as a function of log/x* for 8 different 
ranges in stellar mass. The numbers in each panel list, from top to bottom, the 5th, 25th, 50th, 75th 
and 95th percentiles of each distribution. 
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Figure 12: Conditional density distributions showing trends in the stellar age indicators D„(4000) 
and H5a as functions of the logarithm of the surface mass density ^* and of the concentration index 
C. 
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Figure 13: Top Left: The fraction of galaxies with D„(4000)> 1.55 as a function of log//* in fixed 
ranges in stellar mass. Solid, short-dashed, long-dashed and dotted lines are for galaxies with logM* 
in the ranges 8-9, 9-10, 10-11 and 11-12, respectively. Top Right: The fraction of galaxies with 
D n (4000)> 1.55 as a function of logM* in fixed ranges in stellar surface mass density. Solid, short- 
dashed, long-dashed and dotted lines are for galaxies with log//* in the ranges 7.0-7.8, 7.8-8.3, 8.3-8.8 
and 8.8-9.3, respectively. Bottom Left: The fraction of galaxies with USa < 2 as a function of log /z* 
in the same ranges in M*. Bottom Right: The fraction of galaxies with H5a < 2 as a function of 
log M* in the same ranges in /t* . 
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Figure 14: Conditional density distributions showing trends in /x* as a function the logarithm of 
stellar mass for galaxies with L> n (4000) < 1.55 and C < 2.6 (left) and with D n (4000) > 1.55 and 
C > 2.6(right). 
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